Reducing the Learning Time of Reinforcement Learning for the Supervisory Control of Discrete Event Systems

نویسندگان

چکیده

Reinforcement learning (RL) can obtain the supervisory controller for discrete-event systems modeled by finite automata and temporal logic. The published methods often have two limitations. First, a large number of training data are required to learn RL controller. Second, algorithms do not consider uncontrollable events, which essential control theory (SCT). To address limitations, we first apply SCT find supervisors specifications automata. These remove illegal violating these hence reduce exploration space algorithm. For remaining logic, algorithm is applied search optimal decision within confined space. Uncontrollable events considered as uncertainties in plant model. proposed method nonblocking supervisor all with less time than methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Partial Observation in Distributed Supervisory Control of Discrete-Event Systems

Distributed supervisory control is a method to synthesize local controllers in discrete-eventsystems with a systematic observation of the plant. Some works were reported on extending this methodby which local controllers are constructed so that observation properties are preserved from monolithic todistributed supervisory control, in an up-down approach. In this paper, we find circumstances in ...

متن کامل

the effects of integrating cooperative learning into vocabulary learning of elementary school students

the purpose of the research is to examine if integrating cooperative learning into vocabulary learning helps to increase word recognition of students in an elementary school in iran. it tries to investigate whether cooperative learning approach enables students to improve their language learning. this research used stad (students team achievement division) as a cooperative model in this study. ...

15 صفحه اول

the relationship between locus of control and iranian efl university students’ beliefs about language learning

this exploratory study aimed to investigate a possible relationship between learners’ beliefs about language learning and one of their personality traits; that is,locus of control (loc). both variables, beliefs and locus of control, are assumed to influence the language learning process. the internal control index (ici) and the beliefs about language learning inventory (balli) were administered...

the effect of learning strategies on the speaking ability of iranian students in the context of language institutes

the effect of learning strategies on the speaking ability of iranian students in the context of language institutes abstract language learning strategies are of the most important factors that help language learners to learn a foreign language and how they can deal with the four language skills specifically speaking skill effectively. acknowledging the great impact of learning strategies...

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3285432